Automatic Identification of Predicate Heads in Chinese Sentences
نویسندگان
چکیده
We propose an effective approach to automatically identify predicate heads in Chinese sentences based on statistical pre-processing and rule-based post-processing. In the preprocessing stage, the maximal noun phrases in a sentence are recognized and replaced by “NP” labels to simplify the sentence structure. Then a CRF model is trained to recognize the predicate heads of this simplified sentence. In the post-processing stage, a rule base is built according to the grammatical features of predicate heads. It is then utilized to correct the preliminary recognition results. Experimental results show that our approach is feasible and effective, and its accuracy achieves 89.14% on Tsinghua Chinese Treebank.
منابع مشابه
Representing Topic-Comment Structures in Chinese
Shi (2000) claims that topics must be related to a syntactic position in the comment, thus denying the existence of dangling topics in Chinese. Under Shi's analysis, the dangling topic sentences in Chinese are not topic-comment but subject-predicate sentences. However, Shi's arguments are not without problems. In this paper we argue that topics in Chinese can be licensed not only by a syntactic...
متن کاملPresenting a method for extracting structured domain-dependent information from Farsi Web pages
Extracting structured information about entities from web texts is an important task in web mining, natural language processing, and information extraction. Information extraction is useful in many applications including search engines, question-answering systems, recommender systems, machine translation, etc. An information extraction system aims to identify the entities from the text and extr...
متن کاملChinese Argument Extraction Based on Trigger Mapping
Unlike English, Chinese sentences do not have a strict syntactic structure and ellipsis is a common phenomenon, which weaken the effectiveness of syntactic structure in argument extraction. In Chinese event extraction, lots of arguments cannot be extracted from the sentence successfully, because of the loose connection between the nominal trigger and its arguments. This paper brings forward a n...
متن کاملAn Experimental Study on the Assignment of Focus Accent in Mandarin
This paper investigates the distribution of focus-related accents in the broad focus domain in Chinese Mandarin through 300 natural sentences. The results show that focus –related accent tends to be assigned to the predicate in a subject-predicate structure, to the object in a predicate-object structure, and to the head in an adjunct-head structure unless the head is highly predictable. From th...
متن کاملThree Sensitive Positions and Chinese Complex Sentences: A Comparative Perspective
The positioning of sentential connectives in Chinese complex sentences is more flexible than their counterparts in English. Sentential connectives in Chinese can be placed in three sensitive positions: clause-initial, predicate-initial, and clause-final positions. Due to the co-existence of prepositions and postpositions in the language, sentential connectives can be placed in both clause-initi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010